A Novel Approach to Privacy Preserving Data Publishing Using Slicing Technique
نویسنده
چکیده
Several anonymization techniques, such as generalization and bucketization, have been designed for privacy preserving microdata publishing. Recent work has shown that generalization loses considerable amount of information, especially for high-dimensional data. Bucketization, on the other hand, does not prevent membership disclosure and does not apply for data that do not have a clear separation between quasiidentifying attributes and sensitive attributes. In this paper, we present a novel technique called slicing, which partitions the data both horizontally and vertically. We show that slicing preserves better data utility than generalization and can be used for membership disclosure protection. Another important advantage of slicing is that it can handle high-dimensional data. We show how slicing can be used for attribute disclosure protection and develop an efficient algorithm for computing the sliced data that obey the l-diversity requirement. Our workload experiments confirm that slicing preserves better utility than generalization and is more effective than bucketization in workloads involving the sensitive attribute. Our experiments also demonstrate that slicing can be used to prevent membership disclosure.
منابع مشابه
Efficient Techniques for Preserving Microdata Using Slicing
Privacy preserving publishing is the kind of techniques to apply privacy to collected vast amount of data. One of the recent problem prevailing is in the field of data publication. The data often consist of personally identifiable information so releasing such data consists of privacy problem. Several anonymization techniques such as generalization and bucketization have been designed for priva...
متن کاملSMMCOA: Maintaining Multiple Correlations between Overlapped Attributes Using Slicing Technique
-Knowledge discovery is the most discussed topic now a day. Data which is collected from various resources are processed through various stages and the output of this process is the knowledge which is previously hidden. Basically data mining is a technique whose outputs are previously unknown and potentially useful information from data. There are several challenges of data mining are scalabili...
متن کاملارایه یک روش جدید انتشار دادهها با حفظ محرمانگی با هدف بهبود دقّت طبقهبندی روی دادههای گمنام
Data collection and storage has been facilitated by the growth in electronic services, and has led to recording vast amounts of personal information in public and private organizations databases. These records often include sensitive personal information (such as income and diseases) and must be covered from others access. But in some cases, mining the data and extraction of knowledge from thes...
متن کاملPublishing High-Dimensional Micro Data Using Anonymization Technique
Now a day’s society is experiencing very good growth in the count and variety of data collections having person-specific information as network connectivity, computer technology & disk storage space become increasingly affordable. Large databases is in use today’s society. The large amount of data available means that it is helpful to learn lot of individual information from public data. While ...
متن کاملSlicing : A Efficient Method For Privacy Preservation In Data Publishing
In this paper we propose and prove a new technique called “Overlapping Slicing” for privacy preservation of high dimensional data. The process of publishing the data in the web, faces many challenges today. The data usually contains the personal information which are personally identifiable to anyone, thus poses the problem of Privacy. Privacy is an important issue in data publishing. Many orga...
متن کامل